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1 Introduction 


For series Yhtigi of independent standard normal r.v.’s , where '^tj < oo and gi G 
A/'(0,1), it is well-known their tail estimations of the form 


Pr 




UOi >01] < exp 




see for instance E]. In Section [2] we present a derivation of this estimation and show 

2 

some variational formula on the function -^ji- It will turn out that it is given by 



Ed 


inf 

Y:tibi=a^ 


In Section [3] we prove variational formulas for a wider class of series of independent 
random variables. Let us stress that they mostly do not possess simple forms as in the 
above Gaussian case. 

The function is the Cramer transform of the random series E The Cramer 
transform is the Legendre-Fenchel transform of the cumulant generating function of 
r.v. To realize our purposes we will need the general notion of the Legendre-Fenchel 
transform in topological spaces (see |3] or m)- Let U be a real locally convex Hausdorff 
space and V* its dual space. By (•, •) we denote the canonical pairing between V and 
V*. Let / : U H- )■ M U {cxd} be a function nonidentically oo. By T>{f) we denote the 
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effective domain oi f, i.e. 'D{f) = {u e V : f{u) < cxo}. A function MU{oo} 

defined by 

/*(«*) = sup{(n,M*) -/(«)} = sup -/(«)} {u*eV*) 

u&V u£V{f) 

is called the Legendre-Fenchel transform {convex conjugate) of / and a function /** : 
1/ M U {cx)} defined by 

f**{u) = sup {{u,u*) - r{u*)} = sup {(u,u*) - r{u*)} {u G V) 

u*£V* u*£V{f*) 

is called the convex biconjugate of /. 

The functions f* and f** are convex and lower semicontinuous in the weak* and 
weak topology on V* and V, respectively. Moreover, the biconjugate theorem states 
that the function / : 1/ i— )■ M U {cxo} not identically equal to +cxo is convex and lower 
semicontinuous if and only if / = f**. 

Let us mention additional properties of the convex conjugates; see 4.3 Examples in 
[3]. Let E be a normed space. We denote by || • || the norm of V and by || • ||* the norm 
of V*. For conjugates exponents p,q ^ (1) oo) (^ + ^ = 1), a function is the 

convex conjugate of ^||m||^. 

Remark 1.1. Let us emphasize that in Hilbert spaces a function one can treat 

as the function invariant with respect to the Legendre-Fenchel transform. 

Let us list another properties and one lemma. The convex-conjugation is order- 
reversing: 

f <9 then f* > g* (1) 

and 

u* 

if g{u) = f{au) where (a ^ 0) then g*{u*) = /*(—). (2) 

Qj 

Let V and W be normed spaces. The dual space of their product {V x W)* is 
isomorphic to a direct sum of their dual spaces V* © W* in this sens that a canonical 
pairing between V xW and {V x W)* is given by the sum (u, u*) + (u, v*) where u G H, 
V eW,u* eV* and v* G IF*. 

Lemma 1.2. Let V and W be normed spaces, functions f : V i—)■ RU{oo} and 
(7 : IF I—)■ M U {cxd}. The convex conjugate of a function ip{u,v) = f{u) + g{v) defined 
on F X IF equals 

p*{ufv*) = r{u*)+g*{v*) 
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Proof. The convex conjugate of ip equals 


p,*{u*,v*) = sup {{u,u*) + {v,v*) - f{u) - g{v)} 

{u,v)gVxW 

= sup sup{(M, u*) - f{u) + (u, V*) - g{v)} 

u&V v&W 

= sup{(M, u*) - f{u)} + sup{(u, V*) - g{v)} 

u£V v€W 

= r(n*)+g*(v*). 


□ 


2 A model example 


The moment generating function of a standard normal r.v. g equals = e 2 and 
its cumulant generating function 

V>3(s) :=lnFe^^ = j. 

The function y is invariant with respect to Legendre transform (see Remark II.ip that 
is the Cramer transform of g equals 


V’g(a) 


a 


2 


2 ■ 


Let I C N and {gi)i£i be a sequence of independent standard normal r.v.’s. For 
t = G P‘{I) = consider (convergence in and a.s.). Observe 

that EXl = ~ PIP- Thus a subset {Xt : t G of is isomorphic to 

The cumulant generating function of X^ we will denote by 'ipt, he. ift = 'f’Xt- Notice 
that for hxed s we can consider 'ift{s) as a functional of the variable t in We will 
denote it by -0®. Let us emphasize that = '0t(s)- 

Let In denote a set {l,2,...,iV} O /. For t G i‘^{I) let is dehned as follows: 

tf = ti for i E In and = 0 for i G / \ In- Observe that -P t and 

^ = y ^ y Pf 


as N ^ 00 . For each s G M the function ypp is continuous on It follows that 
'0®(t) = yplp. The function ^p|p is invariant with respect to the convex conjugate 
on (see Remark [T^. Taking in ([2]) fit) = |||t||^ and a = s we have = f{st) 

and get 


(V.”)-(a) = /•(“) 


1 

2 


a 

s 


2 


3 








for s 7 ^ 0. The function '0^(t) = Y||t ||2 is convex and continuous on By the 
biconjugate theorem we have 




sup{(t,a) - (V''*)*(a)} 



1 

2 



Substituting a = sb into the right-hand side we may rewrite the above as follows 






If we split the supremum into two parts: over M and hyperplanes {b G : (b, t) = 
constant} then we get 


V’t(s) = sup sup <s(t,b) —- 


aGR he£^: 
{t,b)=a 


= sup inf ||bf|. 

aeR I 2 heP: J 


(3) 


(t,b)=Q: 


We should remember that = V’t(’S) that is V’t(s) = |(||'t||s)^. Using again 

but now for f{x) = and a = ||t|| we obtain that 


V’t(a) = 


a 


2||t 


12' 


It is the above-mentioned tail estimate for the separable Gaussian process given by 
series XI Ugi- 

The function '0t(s) = Idltlls)^ is convex and continuous on M. By the biconjugate 
theorem we have that 


Comparing ([3]) and 


=sup|sa--^ 
we see (for a general case see Th J3.3p that 
_ 1 

W “ 2 b|- 

(t,b)=Q: 


(4) 


what we could also check by using the Lagrange multipliers technique. 
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3 Main Theorem 


The cumulant generating function ipxis) = InT^e^^ of any random variable X is convex 
and lower semicontinuous on M (analytic on intV{ipx))- It maps M into M U {cx)} and 
takes value zero at zero but it is possible that ipxis) = oo when s 7 ^ 0. It is reasonable 
to assume that it is hnite on some neighborhood of zero, i.e. X satisfies condition: 
3a>o s.t. < 00 . Let us emphasize that if EX = 0 then ipx > 0 but the Cramer 

transform 'ipx is always nonnegative and attains 0 at the value EX. 

Before proving our main theorem, we show forms of the cumulant generating func¬ 
tion and the Cramer transform of series of independent random variables. 

Proposition 3.1. Let be a sequence of zero-mean independent random variables 

with common bounded second moments. Let for each i E I the cumulant generating 
function := fJXii.s) = Ini^e^'^* is finite on some neighborhood of zero. Then 

for each t = (ti)i^i E the cumulant generating function of a random series X^ = 

is given by the following equality 

^t(s) := ^Xt(s) = ^f^iisti). 

i&l 

Proof. Because (Xj) are independent, centered and have common bounded second 
moments then for every t E the series Xt = converges in L^ and a.s.. Let 

us emphasize that the convergence of series Xt in L^ is equivalent to the convergence 
of sequences t in By fixed s we can consider the cumulant generating function iptis) 
as a functional of the variable t in We will denote it by i.e. = iptis). 

We show that for every s G M the functional is convex and lower semicontinuous on 

e. 

Convexity one may check by using the Holder inequality. Let t, u G and A G (0,1) 
then 

^"(At +(1 - A)u) = 

= = In E . 

By the Holder inequality for exponents 1/A and 1/(1 — A) we get 

and, in consequence, 

^"(At + (1-A)u) < AlnEe"^* + (1 - A)lnEe"^“ 

= A^"(t) + (1 - A)^*(u). 
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Lower semicontinuity follows from Fatou’s lemma. Let t"" —)■ in Note that 

Xtn converges a.s. to Xto. Then 

liminf^/>^(t"') = liminf > lnE(hminf 

n—)-oo n^oo n—>-oo 

= In F;(e" 

It means that V’* is lower semicontinuous on 

Let io denote the space of sequences with hnite supports. Observe that io is a dense 
subset of f'^. For t E io we have 

(t) = In = In JJ 

iei 

= 

iei 

For t E i"^ consider a series V’i('SL)- Since EXi = 0, 'ipi > 0. It follows that 
'ipi{sti) is convergent or divergent to plus inhnity. Since ^l)i are convex, this series 
dehnes a convex function on the whole f'^. Let —)■ in f'^. Hence for every i E I 
f" —)■ By superadditivity of the limit inferior and, next, by lower semicontinuity of 

each '^j, we get 

liminf= liminf liminf 

n—)-oo n^oo ' ^ ^ n—>-co 

i&I 


Notice that both functions: and the series are convex and lower 

semicontinuous on and moreover coincide on io (a dense subset of i"^). It follows 
that these functions are equal on whole f'^, i.e. 


i&I 


for every t in 


□ 


Let us observe that for s = 0 = 0 and its convex conjugate ('^°)*(a) = 0 for 

a = 0 and oo otherwise. From now on we assume that s 7 ^ 0. A form of for s 7 ^ 0 
is described in the following: 

Proposition 3.2. Under the assumptions of Proposition 1X71 The convex conjugate of 
V''(t) = EiG/ 'ipi{sti) defined on U equals 

for a G where ^|J* ’s are the Cramer transforms of Xi’s. 
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Proof. The convex conjugate (V’*)* is convex and lower semicontinuous on (£^)* ~ 
Assume first that / is a hnite set. By virtue of the form of Lemma 11.21 and the 
property ([2]), for a in we get 

OT(a) = E'*.*(-) 

iei ^ 

Dehne now a functional Yliei whole space Since if*^s are convex and lower 

semicontinuous, this functional is convex and, similarly as in the case of 
can show that it is also lower semicontinuous on Because this functional coincides 
with (V’*)* on the dense subspace Iq then both functionals are equal on □ 

Let us emphasize that the functions are nonnegative and lower semicontin¬ 

uous. In large deviation theory such functions are called rate functions (good rate 
functions when level sets are not only closed but also compact). In the main theorem 
below we show that the contraction principle applied to the function by using a 
functional (t, •) over £‘^ gives the Cramer transform of X^. 

Theorem 3.3. Let a sequence of r.v. ’s satisfies the assumptions of Proposition 

\3.1\ Then for every t = (ti)i^i G the Cramer transform = V't ® random series 
X, = Eia tiXi is given by the following variational formula 


tpl{a) 


inf 

{t,b}=a 




for a G int'Dpipl), where p)* ’s are the Cramer transform of Xi’s. 

Proof. The functional is convex and lower semicontinuous on C. By virtue of the 
biconjugate theorem we have 


p^"{t) = sup{(t,a) - (i/>")*(a)}, 

where {tp^)*{a.) = 7^ 0)- Substituting a = sb we get 


iei iei 

and we can rewrite the above as follows 


V'"(t) = sup{s (t, b) - (^/>^)*(b)}. (5) 

be£2 

Let us return to the function pj^ which is convex and lower semicontinuous on M. 
By the biconjugate theorem we have 

tpt{s) = sup{sa — tpp{a)}. 

oGK 
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Let us recall that 'ipt{s) = If we split the supremum of ([S]), as in Model Example, 

into two parts then we get 

V’t(s) = ^/>''(t)=sup sup {s (t,b) - (V’^)*(b)} 

(t,h}=a 

= supisa — inf ('0^)*(b)}. 

aGR bG£2: (t,b>=a' 

Let denote the function infbg£ 2 . (t,b>=a('0^)*(b)- The functional is con¬ 
vex on Convexity is preserved under contraction by linear transformation (see |H 

Th.IIL32]). It suffices to state that (pt and ipl coincide on int'D{'ipl) (both ones take 
cxD on a complement of cl'D{'ipl)) that is 

V’t(a) = ^^f ^ (^^)*(b), 

{t,b)=a 

for a e inWippl), where ('^^)*(b) = i’i i^i)■ ITI 

Remark 3.4. We can not prove in general that (ft is lower semicontinuous. Sometimes 
it is obvious when for instance the effective domain of 'ipl is open subset of M (or even 
whole M; see Model Example and Example 13.51) . Under an assumption that is a 

good rate function with respect to weak* topology we can state lower-semicontinuity 
of (ft (see Example 13.61) . 

Example 3.5. Let X be r.v. with a Laplace density Its moment generating 

function for |s| < 1 and oo otherwise. Let us observe that 

1 — S^ I I 


= 


oo oo 2n 

V > V —— 

^ ^ n!2” 

n=0 n=0 


= e- = 


where g is standard normal distributed. Thus tpx > V's since the Cramer transform 
is order-reversing, we have 


'ip*x{oi) < 



( 6 ) 


Moreover by using the classical Legendre transform we can calculate an evident form 
of p)*x and get 




a 


X 


a = 


+ In 


yjl + + 1 yjl -|- -f 1 


Consider a sequence (Xj)jg 7 of independent r.v.’s with the same Laplace distribu¬ 
tion. By dH]) we have 

(v‘r(b) = 5;«(6.)<l|ib||i 

i£l 
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It means that takes finite values on the whole space and for every a G M there 
is a finite infimum: 


inf 

(t,b)=Q 


i&I 


that is the finite value of tpl at a. 


In the paper [7j one can find an example of variational formulas for the Cramer 
transform of series of weighted symmetric Bernoulli random variables but with coef¬ 
ficient belonging to the space . In the context of our Theorem 13.31 we recall main 
result of this paper but now with coefficients in the bigger space 

Example 3.6. If X is a symmetric Bernoulli r.v., i.e. Pr{X = ±1) = then = 

cosh s. By power series expansions one has cosh s < e 2 . In this example, conversely 

2 

as in previous one, we get that tpxis) < 'P’gis) = ^ and 





(7) 


One can check that 

= ^[(7 + “) + «) + (!-«) ln(l - a)] 


for |a| < 1 and 00 otherwise; we take OlnO = 0. Note that ■^^(±1) = In2. 

For a sequence of independent Bernoulli r.v.’s, by the above inequality, we have 

(V-')*(b) > i||bf. 

Since is lower semicontinuous in the weak* topology, level sets {b G : ('0^)*(b) < 
c} are weak* closed; for c < 0 are empty sets. By the above, for each c > 0 the level set 
of is contained in a closed ball i?(0; \/^) = {b G : ||b|| < \plc\- By virtue of 
the Banach-Alaoglo theorem (see m Th. 3.15]) balls 5(0; \/^) are weak* compact (are 
polar sets of balls 5(0; 1/v^))- It follows that the level sets {b G : ('0^)*(b) < c} 
are weak* compact as closed subsets of compact sets. It means that in this topology 
is a good rate function and by the contraction principle (see |21 Th.4.2.1]) (^t is 
also good rate function on M, in particular, it is lower semicontinuous. 

Let us emphasize that if we know that {yY* is a good rate function then we can 
prove lower-semicontinuity of (pt- 


Remark 3.7. Eventually, let us emphasize once again that only in Gaussian cases we 
can solve the variational formula on exponential rate functions for tail estimates of 
series of independent r.v.’s. For non-Gaussian processes it remains at us only these 
variational principles. 


9 



References 


[1] V. Barbu, T. Precupanu, Convexity and Optimization in Banach Spaces, 4th ed., 
Springer Monographs in Mathematics, Springer, Dordrecht, (2012) 

[2] A. Dembo, O. Zeitouni, Large Deviations Techniques and Applications. Corrected 
reprints of the second (1998) edition. Stochastic Modeling and Applied Probability, 
38, Springer-Verlag, Berlin, (2010) 

[3] 1. Ekeland, R. Temam, Convex Analysis and Variational Problems, Translated from 
French. Corrected reprint of the 1976 English edition. Classics in Applied Mathe¬ 
matics, 28, Society for Industrial and Applied Mathematics (SIAM), Philadelphia, 
PA, (1999) 

[4] F. den Hollander, Large Deviations, Fields Institute Monographs, 14, Providence, 
Rl: American Mathematical Society (2000) 

[5] W. Rudin, Functional Analysis, 2nd ed.. International Series in Pure and Applied 
Mathematics (1991) 

[6] M. Talagrand, Majorizing measures: the generic chaining, Ann. Probab. 24, 1049- 
1103 (1996) 

[7] K. Zajkowski, Cramer transform of Rademacher series. Positivity (2014), doi; 
10.1007/sllll7-014-0313-5. 


10 



